Multifunction Thesaurus For Russian Word Processing

نویسنده

  • Igor A. Bolshakov
چکیده

A new type of thesaurus for word processing is proposed. It comprises 7 semantic and 8 syntagmatic types of links between Russian words and collocations. The original version now includes ca. 76,000 basic dictionary entries, 660,000 semantic and 292,000 syntagmatic links, English interface, and communication with any text editor. Methods of delivery enriching are used based on generic and synonymous links.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation experiments on related terms search in Wikipedia: Information Content and Adapted HITS (In Russian)

The classification of metrics and algorithms search for related terms via WordNet, Roget’s Thesaurus, and Wikipedia was extended to include adapted HITS algorithm. Evaluation experiments on Information Content and adapted HITS algorithm are described. The test collection of Russian word pairs with human-assigned similarity judgments is proposed.

متن کامل

Word Association Thesaurus As a Resource for Building WordNet

The goal of the present paper is to report on the on-going research for applying psycholinguistic resources to building a WordNet-like lexicon of the Russian language. We are to survey different kinds of the linguistic data that can be extracted from a Word Association Thesaurus, a resource representing the results of a largescaled free association test. In addition, we will give a comparison o...

متن کامل

Sociopolitical Thesaurus in Concept-based Information Retrieval

In CLEF2005 experiments we used bilingual Russian-English Sociopolitical thesaurus that we constructed for more than 10 years specially as a tool for automatic text processing in information-retrieval tasks. The same resource and the same algorithm were used for ad-hoc and domain –specific tasks.

متن کامل

RuThes Linguistic Ontology vs. Russian Wordnets

The paper describes the structure and current state of RuThes – thesaurus of Russian language, constructed as a linguistic ontology. We compare RuThes structure with the WordNet structure, describe principles for inclusion of multiword expressions, types of relations, experiments and applications based on RuThes. For a long time RuThes has been developed within various NLP and informationretrie...

متن کامل

ارائه روشی برای استخراج کلمات کلیدی و وزن‌دهی کلمات برای بهبود طبقه‌بندی متون فارسی

Due to ever-increasing information expansion and existing huge amount of unstructured documents, usage of keywords plays a very important role in information retrieval. Because of a manually-extraction of keywords faces various challenges, their automated extraction seems inevitable. In this research, it has been tried to use a thesaurus, (a structured word-net) to automatically extract them. A...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994